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FOREWORD 

This  paper  represents  the  theoretical  basis  for  a  research  project  currently 
being  conducted  by  the  Aviation  Research  Laboratory  of  the  Institute  of  Aviation, 
University  of  Illinois  at  Urbana-Champaign.  This  research  is  being  supported  by 
the  Life  Sciences  Program,  Air  Force  Office  of  Scientific  Research.  This  project  is 
one  of  eight  tasks,  four  of  which  are  in  the  area  of  general  pilot  selection,  training, 
and  performance  and  four  of  which  deal  with  avionics  system  design  principles. 
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INTRODUCTION 

Although  Response  Surface  Methodology  (RSM)  was  originally  developed 
by  Box  and  Wilson  (1951)  for  use  in  chemical  research,  its  utility  in  psychological 
research,  especially  in  studies  of  human  performance,  also  has  been  documented 
(Meyer,  1963;  Simon,  1970;  Williges  &  Simon,  1971).  RSM  is  a  tool  for  assessing 
the  effect  of  continuous,  quantitative  factors  upon  some  response  of  interest  to 
the  Investigator.  Unlike  tradirio"  al  factorial,  analysis  of  variance  designs,  RSM 
focuses  primarily  on  the  functional  relationship  which  exists  between  the  response 
and  the  factors,  rather  than  merely  determining  the  significance  of  various  factors. 
Indeed,  if  one's  aim  is  to  determine  the  quantitative  relationship  between  human 
performance  and  a  host  of  equipment  parameters,  the  tradirional  methods  fall  short. 

In  a  recent  application  of  RSM  to  a  target  recognition  experiment,  Simon  (1970) 
noted  several  direct  benefits  which  accrue  from  a  RSM  regression  equation  approx¬ 
imating  the  response  surface.  Such  equations  allow  one  to  estimate  performance 
levels  at  given  levels  of  system  variables,  to  estimate  necessary  levels  of  system 
variables  given  a  designated  performance  level,  to  determine  the  tradeoffs  among 
system  parameters  while  optimizing  performance  level,  and  to  judge  the  relative 

importance  of  the  various  system  parameters. 

In  addition  to  providing  an  approximation  of  the  relationship  between  perfor¬ 
mance  and  factors,  RSM  advances  a  variety  of  experimental  designs  to  achieve  that 
estimate  as  efficiently  and  as  economically  as  possible.  When  using  factorial 
designs,  the  investigator  is  often  forced  by  practical  considerations  to  limit 
the  number  of  factors  studied  to  even  less  than  the  number  that  he  believes 
has  a  critical  effect  on  performance.  In  such  a  case  he  must  conduct  multiple 
studios  yielding  isolated  investigations  of  only  a  few  factors  at  any  one  time.  This 
results  in  an  unrealistic  view  of  a  system  in  which  a  number  of  factors  operate 
simultaneously  to  influence  performance.  By  allowing  the  investigator  to  consider 
large  numbers  of  factors  within  a  single  study,  RSM  proves  a  valuable  investigatory 
tool .  Through  strategic  sampling  of  data  points,  RSM  provides  the  most  essential 
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information  and  allows  one  to  decide  whether  or  not  the  collection  of  additional 
data  is  merited. 

Most  RSM  designs  are  special  cases  of  the  Box  end  Wilson  (1951)  central - 
composite  design.  Because  these  designs  were  originally  developed  for  chemical 
research,  it  is  not  unreasonable  to  anticipate  the  need  for  some  modification  in 
their  basic  design  to  make  the  method  more  adaptable  to  research  involving  human 
subjects.  The  purpose  of  this  paper  is  to  suggest  several  appropriate  design 
modifications  that  attempt  to  retain  as  many  of  the  positive  traits  of  the  RSM 
technique  as  possible.  Before  discussing  these  modifications,  a  brief  description 
of  central -composite  designs  is  necessary. 

CENTRAL-COMPOSITE  DESIGNS 

By  way  of  example,  suppose  that  an  investigator  is  interested  in  predicting 
radar  target  detection,  Y,  as  a  function  of  display  resolution,  ,  visual  angle, 
and  random  noise,  X^.  When  developing  an  equation  to  approximate  the 
relationship  between  a  given  response  and  a  set  of  experimental  variables,  an 
investigator  generally  measures  the  desired  response  at  relatively  few  data  points, 
each  defined  by  some  unique  combination  of  experimental  variable  levels.  From 
the  resulting  data  he  then  makes  inferences  about  the  nature  of  the  underlying 
relationship.  The  basic  central-composite  design  details  a  strategy  to  be  followed 
in  selecting  these  data  collection  points  that  insures  orthogonality  of  factors 
while  substantially  limiting  the  required  number  of  observations.  The  identifying 
feature  of  a  central -composite  design  is  that  all  the  data  collection  points  are 
spaced  symmetrically  about  and  at  a  specified  distance  from  an  arbitrary  center  point 
that  is  determined  by  the  experimental  region  of  greatest  interest  to  the  investigator. 
The  data  points  can  be  thought  of  as  sampling  three  portions  of  the  overall  ex- 
perimental  region.  In  a  K-factor  design,  a  2  factorial  (cr  fractional  factorial) 
is  combined  with  a  2K  star  plus  an  appropriate  number  of  center  points.  For  the 
hypothetical  case  of  K=3  outlined  above,  the  design  can  be  diagrammed  as  in 


-3- 


Figure  1  .  Eight  unique  data  points  lie  at  the  vertices  of  a  cube;  6  other  unique 
data  points  form  a  six  arm  star.,  radiating  from  yet  a  fifteenth  unique  point  at  the 
center  of  the  cube  and  extending  beyond  the  surfaces  of  the  cube. 

For  each  factor  under  consideration,  five  levels  of  it  must  occur.  Further¬ 
more,  these  five  levels  must  correspond  to  the  coded  values  0,  ±1,  and  ±a, 
whereof  represents  fhe  length  of  the  star  arm  or  the  radius  of  the  hypersphere  on 
which  data  are  located.  The  appropriate  real  world  values  for  each  factor  are  determined 
by  the  range  of  values  in  which  one  is  interested  and  by  the  constraints  placed  upon  them 
by  the  required  coded  values.  For  example,  suppose  that  display  resolution  ranges  in 
value  from  168  to  504  TV  lines/dm  and  that  these  two  limiting  values  are  subse¬ 
quently  assigned  coded  values  of  -  a  =  -  1  .682  and  Of  =  +  ]  .682,  respectively.  Then  the 

X1  -  336 

corresponding  transformation  —  dictates  that  the  other  levels  of 

X^  be  236  ,  336  ,  436,  which  correspond  to  the  coded  values  -1,0,  and  1 , 
respectively.  Although  coding  the  appropriate  raw  score,  independent  variable 
levels  is  not  necessary,  the  use  of  linear  transformations  of  the  data  permits  a 
simpler  analysis  without  affecting  the  result  of  any  subsequent  statistical  test. 

\ 

On  occasion  this  rigid  demand  regarding  the  selection  of  data  points  makes  the 
central -composite  design  impractical  for  some  human  factors  studies.  For  example, 
variables  such  as  target  type,  target  complexity,  and  briefing  instructions  are  not 
readily  quantifiable.  Moreover,  it  is  sometimes  neither  practical  nor  feasible  to 
measure  even  certain  quantifiable  variables  at  the  five  levels  specified  by  the 
central -composite  design. 

Replication 

The  original  central-composite  design  calls  for  repeated  observations  to  be 
made  at  the  designated  center  point  only.  As  noted  by  Cochran  and  Cox  (1957), 
these  replicated  center  points  provide  additional  degrees  of  freedom  to  permit 
an  estimate  of  experimental  error,  and  they  determine  the  precision  of  the  estimate 
at  and  near  the  center.  Choosing  the  appropriate  number  of  center  points  results 
in  a  design  in  which  the  standard  error  of  estimate  is  roughly  the  same  at  all 
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points  within  the  experimental  region.  Hence,  the  estimate  of  error  computed  at 
the  center  can  be  used  as  an  estimate  of  error  throughout  the  experimental  K -space, 
thereby  minimizing  redundancy.  Too  many  replications  at  the  center  yield  standard 
errors  of  estimate  which  increase  rapidly  for  those  points  farther  from  the  center.  On 
the  other  hand,  with  too  few  replications  of  the  center  point,  the  standard  error  is 
apt  to  be  greater  at  the  center  than  at  the  surrounding  data  points. 

Rotatability 

Another  desirable  property  of  many  central-composite  designs  is  rotatability 

(Box  and  Hunter,  1957).  Rotatability  insures  equal  reliability  of  predicted  responses 

at  all  data  points  equidistant  from  the  center.  This  is  an  especially  convenient 

design  quality  in  exploratory  work  when  the  investigator  is  ignorant  of  the  response 

surface  and  its  relative  orientation  to  the  orthogonal  factor  axes.  Rotatability 

K/4 

imposes  the  additional  constraint  that  the  value  of  a  be  equal  to  2  '  .  As  the 

K 

number  of  factors  increases  ,  fractional  replicates  of  the  full  2  factorial  are  used, 
thereby  reducing  the  overall  size  of  the  experiment.  In  general,  when  fractional 
factorial  designs  of  (1/2)^  are  used  in  place  of  the  full  factorial,  then  0!  must  equal 
2  (k  p)/4  rofatobi I ity  is  to  exist  (Cochran  and  Cox,  1957).  Thus,  if  the  hypothetical 
three  factor  design  diagrammed  in  Figure  1  is  to  be  rotatable,  the  star  arm  must  be 
1  .682  (coded  value).  To  insure  roughly  equal  precision  of  prediction  across  the 
entire  experimental  region,- the  center  point  is  replicated  6  times.  When  complete, 
the  design  involves  a  total  of  20  observations  with  14  of  the  experimental  points 
lying  on  the  surface  of  a  sphere  of  radius  1 .682,  and  with  six  observations  being 
made  at  the  center  point  (0,  0,  0). 

Blocking  ; 

An  additional  feature  of  RSM  designs  that  affords  the  investigator  greater 
efficiency  and  flexibility  is  blocking.  Under  blocking  conditions  subsets  of  the 
total  data  collection  points  are  studied  together.  If  the  blocking  is  orthogonal, 
any  differences  in  mean  performance  among  blocks  are  independent  of  any  main 
effects  due  to  the  independent  variable  manipulations,  and  as  such,  they  do  not 


affect  the  underlying  quantitative  relationship  between  factors  and  performance. 

This  aspect  of  design  is  valuable  to  the  human  factors  engineer  who  is  concerned  with 
isolating  potential  effects  due  to  such  factors  as  different  experimenters,  changes  in 
apparatus,  and  environmental  conditions.  Recall  the  investigator  studying  radar  target 
detection  latency  as  affected  by  display  resolution,  visual  angle,  and  random  noise. 

It  is  unlikely  that  all  the  necessary  data  can  be  collected  during  a  single  flight  or 
perhaps  not  even  in  the  same  aircraft.  By  taking  advantage  of  orthogonal  blocking 
techniques,  he  can  guard  against  the  parameters  of  the  RSM  prediction  equation 
being  affected  by  such  differences. 

Orthogonal  blocking  places  additional  constraints  on  rhe  central -composite 
design  concerning  the  selection  of  Cl  and  the  number  of  center  points.  These  para¬ 
meters  must  be  chosen  to  insure  that  the  average  predicted  response  level  is  the  same 
for  every  block.  Orthogonal  blocking  is  guaranteed  when  the  following  condition 
is  met  (Box  and  Hunter,  1957): 

2K/2«2  =  (N  +  N  )/(N  +  N  )  (1 ) 

c  CO  s  so 

where  N  and  N  are  the  number  of  center  points  added  to  the  intact  2 
co  so  r 

factorial  portion  and  the  2  K  star  portion  of  the  design,  respectively.  Nc  and  N 

K  S 

reflect  the  number  of  noncenter  points  in  the  2  factorial  and  in  the  2K  star, 
respectively.  Given  the  proposed  design  in  Figure  1  for  studying  detection  latency, 
orthogonal  blocking  can  be  achieved  by  dividing  the  20  data  points  into  subsets 
of  6,  6,  and  8  observations,  as  depicted  in  Figure  2.  The  corresponding  value  of 
a  is  1  .633. 

23/2a2  =(8+4)/(6+2) 
a  =  1 .633 

Recalculating  the  real  world  levels  of  display  resolution  using  this  new  a  value, 
the  investigator  learns  that  the  levels  appropriate  to  the  orthogonally  blocked 
design  are  173  ,  236,  336  ,  436,  499.  He  must  also  recompute  the  appropriate 
real  world  levels  of  visual  angle  and  random  noise  in  a  like  manner.  Note  that 


BLOCK  1 

BLOCK  2 

BLOCK  3 

(  1,  -1,  1  ) 

(“1.  I.-I  ) 

(-1.633,0,0) 

(  1,  1,  'I  ) 

(-1,-1,  1  ) 

(0,-1.633,  0) 

("I,  M) 

(  1 , -l.-l ) 

(0,  0,-1.633) 

(-l.-l.-l  ) 

(  1,  1,1  ) 

(  1.633,  0,  0  ) 

(  0,0,  0  ) 

(0,0,0) 

(0,1.633,  0) 

(0,0,0  ) 

(0,0,0) 

(  0,  0,  1.633) 

(0,  0,0  ) 

(0,  0,0) 

Figure  2.  Orthogonal  blocking  of  second-order  central-composite  design  in  three 
variables  with  coded  value  coordinates  of  data  points. 


the  vdue  of  a  required  to  insure  orthogonality  is  slightly  different  from  the  1 .682 
value  required  for  rotatability.  In  order  to  achieve  orthogonal  blocking,  it  is 
often  necessary  to  sacrifice  rotatability.  In  human  factors  applications,  however, 
the  potential  gains  from  orthogonal  blocking  probably  outweigh  the  risk  of  forfeiting 
rotatability. 

Added  flexibility  ccn  jccrue  from  use  of  blocking  techniques,  as  Box  and 
Hunter  (1957)  illustrated  when  they  employed  blocking  to  facilitate  exploration 
of  a  response  surface.  A  properly  blocked  design  permits  research  to  be  conducted 
in  stages.  Each  block  of  data  points  from  the  complete  second-order  design 
constitutes  a  first-order,  rotatable  central-composite  design.  Gathering  data  from 
the  first  series  of  blocks, the  investigator  can  judge,  for  example,  whether  any  of 
the  original  experimental  variables  merits  being  dropped  from  further  consideration, 
or  whether  greater  than  a  linear  polynomial  is  needed  to  explain  adequately  the 
data.  The  ability  to  make  such  decisions  at  an  early  stage  may  mean  that  the 
investigator  is  able  to  conclude  his  study  after  collection  of  considerably  less  data 
than  he  had  originally  anticipated. 


Analyses 

Given  data  from  a  central-composite  RSM  design,  the  analyses  of  interest 

are: 

1)  Multiple  regression  analysis  to  determine  a  prediction  equation  which 
approximates  the  functional  relationship  between  the  desired  response 
and  the  experimental  factors.  Par  the  case  of  three  independent 
variables,  a  general,  complete  second-order  polynomial  equation  would 
be  as  follows: 

Y=bo+b,XI+b2X2+b3X3+fc4X12+b5X22+  (2) 

b6X32+b/lX2+b8XlX3+b9X2X3 
where  Y  is  the  resulting  response  level,  Xj,  X and  X^  are  levels 
of  the  three  independent  variables,  and  the  b.'s  are  constant  values 
known  os  partial  regression  coefficients.  The  multiple  regression 
analysis  determines  the  numerical  values  of  those  partial  regression 
coefficients  b  ,  ...,  bn  according  to  a  least  squares  criterion. 


-9- 


2)  Analysis  of  vorionce  (ANOVA)  performed  on  the  results  of  the  preceding 

regression  analysis.  The  ANOVA  determines  what  proportion  of  the 

total  variance  in  real  world  response  is  explained  by  the  prediction 

equation;  this  estimate  of  the  accuracy  of  the  prediction  equation  is 

2 

represented  by  the  coefficient  of  determination,  R  .  The  ANOVA  also 
determines  the  significance  of  each  partial  regression  coefficient, 
indicating  whether  or  not  the  corresponding  variable  is  significant  in 
determining  the  response  level .  In  addition,  the  ANOVA  can  test  for 
the  significance  of  blocking,  inter-subject  differences,  and  lack  of  fit. 

A  significant  lack  of  fit  term  indicates  that  a  higher  order  prediction 
equation,  or  a  more  complete  equation  of  the  seme  order,  is  required  in 
order  to  represent  the  data  adequately. 

Consider  again  the  study  of  radar  target  detection  latency,  Y,  as  a 
function  of  display  resolution,  visual  angle,  and  random  noise,  X^,  X^r  and  X^, 
respectively.  Hypothetical  data  for  such  a  study  are  presented  in  Table  1 .  A 
multiple  regression  analysis  of  these  hypothetical  data  yields  the  following  generalized, 
first  order  prediction  equation: 

Y  =  16.115  -  1 .203  X]  -  0.503  X2  +  0.847  X^ 

Substituting  given  levels  of  the  independent  variables  into  this  equation  affords 
the  investigator  a  corresponding  predicted  level  of  detection  latency. 

The  results  of  a  subsequent  ANOVA  performed  on  the  regression  analysis 
appear  in  Table  2.  The  derived  equation  accounts  for  nearly  74%  of  the  total 
variance  in  detection  latency.  Each  of  the  coefficients,  excluding  the  constant 
term  bQ,  is  significant  at  well  beyond  the  .01  level.  But,  blocks  are  not  signifi¬ 
cant.  Noting  that  the  lack  of  fit  term  is  significant,  the  investigator  will  submit 
his  data  to  a  second  multiple  regression  analysis  to  determine  a  higher  order 
prediction  equation. 
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TABLE  1 

Hypothetical  Data  in  Coded  Form  for  a  Three-Factor, 
Central  -Composite  RSM  Design 


Observation 

Block 

x, 

Resolution 

x2 

Visual  Angle 

X3 

Random  Noise 

Detection  Latency 
(Seconds) 

1 

1 

1.0 

-1 .0 

1.0 

16.2 

2 

1 

1.0 

1.0 

-1.0 

14.3 

3 

1 

-1.0 

1.0 

1.0 

17.0 

4 

1 

-1.0 

-1.0 

-1.0 

17.4 

5 

1 

0.0 

0.0 

0.0 

15.5 

6 

1 

0.0 

0.0 

0.0 

15.8 

7 

2 

-1.0 

1.0 

-1 .0 

16.8 

8 

2 

-1.0 

-1.0 

1.0 

18.1 

9 

2 

1.0 

-1 .0 

-1.0 

14.9 

10 

2 

1.0 

1.0 

1.0 

16.2 

11 

2 

0.0 

0.0 

0.0 

15.0 

12 

2 

0.0 

0.0 

0.0 

14.8 

13 

3 

-1 .63 

0.0 

0.0 

19.0 

14 

3 

0.0 

-1 .63 

0.0 

17.3 

15 

3 

0.0 

0.0 

-1 .63 

14.8 

16 

3 

1.63 

0.0 

0.0 

13.9 

17 

3 

0.0 

1.63 

0.0 

14.6 

18 

3 

0.0 

0.0 

1.63 

19.2 

19 

3 

0.0 

0.0 

0.0 

15.8 

20 

3 

0.0 

0.0 

0.0 

15.7 
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TABLE  2 

First-Order  Regression  Analysis  of  Variance  Summary  Tcble 
for  Hypothetical  Detection  Latency  Data 


Source 

Percent 

df 

Sum  of  Squares 

Mean  Square 

Regression 

(73.92) 

(  3) 

32.18 

10.73 

fai 

44.25 

1 

19.26 

19.26 

fa2 

7.75 

1 

3.37 

3.37 

b3 

21.93 

1 

9.54 

9.54 

Residual 

(26.08) 

06) 

11.35 

0.71 

Blocks 

0.94 

2 

0.41 

0.21 

Lock  of  Fit 

24.9 7 

11 

10.87 

0.99 

Error 

0.16 

3 

0.07 

0.02 

Total 

100.00 

19 

43.53 

Multiple  Regression  Coefficient  = 

0.86 

*  £  <  .05 

***p<.001 

Coefficient  of  Determination  = 

0.74 

F -Value 

536.50  *** 
963.00  *** 

168.51  *** 
477.00  *** 

10.50* 
49.50  *** 
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DESIGN  CONSIDERATIONS 

In  a  recent  crticle,  Williges  and  Simon  (1971)  discussed  several  general 
odvcnicges  of  the  RSM  technique  which  contribute  to  its  potential  value  in  human 
rectors  research.  Among  the  most  obvious  benefits  is  the  economy  of  data  collection. 
Not  only  is  sampling  restricted  to  the  experimenfcl  region  of  greatest  interest,  but 
also  repeated  observations  ere  restricted  to  the  center  pain*  of  that  region.  A 
originally  conceived,  RSM  was  developed  as  a  methodology  for  quickly  locating 
optimums  by  means  of  o  series  of  experiments  each  dependent  on  the  results  of  the 
preceding  one.  More  specifically.  Box  end  Wilson  0951)  were  interested  in 
determining  the  optimum  combination  of  feefor  levels  needed  to  produce  the  maximum 
yield  from  o  chemical  reaction.  However,  human  factors  engineers  ere  largely 
interested  in  deriving  global  prediction  equations  which  allow/  them  to  predict 
performance  levels  accurately  throughout  an  entire  range  of  factor  levels. 

When  the  goal  is  to  approximate  cn  entire  response  surfcce,  rather  them 
merely  that  portion  of  the  surfcce  surrounding  the  optimum,  limiting  multiple 
observations  to  a  single  experimental  point  may  not  be  the  most  judicious  strategy. 
Indeed,  the  actual  variability  in  response  may  be  so  great  across  subjects  and  data 
points,  that  to  presume  the  standard  error  of  estimate  ct  the  center  point  is  an 
adequate  estimate  of  error  at  all  points  is  unrealistic,  A  recent  study  concerning 
transfer  of  trainirg  (Baron  end  Willices,  1971)  affords  a  striking  demonstration  of 
the  effect  of  estimating  experimental  error  ct  a  single  replicated  point  as  opposed 
to  estimating  it  across  a  series  of  replicated  points.  When  replications  were 
restricted  to  the  center  point,  none  of  the  experimental  factors  were  found  to 
contribute  significantly  to  the  response  level,  despite  their  heavy  weightings  in 
the  resulting  prediction  equation.  When  multiple  observations  were  made  at  each 
of  the  data  points,  however,  the  subsequent  analysis  revealed  that  all  of  the 
experimental  variables  were  significant  in  determining  'he  response  level .  Of 
course,  when  the  basic  central-composite  design  is  modified  in  such  a  manner,  meth¬ 
odological  questions  arise  concerning  how  best  to  retain  the  positive  attributes  of 
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the  basic  design,  while  still  nicking  the  modifications  cppropriote  :o  research  with 
human  subjects.  For  example,  should  repeated  observations  be  mode  at  more  than 
one  experimental  poirt;  should  all  data  be  retained  or  should  they  be  collapsed; 
should  different  subjects  be  observed  at  each  experimental  point  or  should  the 
seme  subjects  be  observed  oil  points;  under  whet  conditions  ore  particular  design 
variations  especially  appropriate? 

The  following  discussion  proposes  severe!  design  variations  appropriate 
to  human  factors  research  together  with  the  ensuing  methodological  considerations. 

A  generalized  computer  program  to  analyze  data  from  each  of  these  design  variations 
cs  well  as  ccia  from  the  bcsic  central -composite  design  has  been  developed  by 
Clerk,  WJlices,  and  Comer  <1971). 

Collapsed  Designs 

The  simplest  modification  is  achieved  merely  by  replicating  the  entire 
central -composite  design  o  given  number  of  times.  Consider,  for  example,  the 
orthogonally  blocked,  central-composite  design  depicted  in  Figure  2.  Suppose 
the  investigator  elects  to  replicate  that  design  five  times.  The  data  points  remain 
the  same  as  those  listed  under  Figure  2.  Now,  however,  the  design  involves  a 
total  of  100  observations,  over  a  total  of  15  treatment  conditions.  Block  1  now 
contains  30  observations.  Block  2  contains  30  observations,  end  Block  3  contains 
40  observations.  Note  that,  although  multiple  observations  have  been  made  at 
each  of  the  experimental  points,  the  center  point  has  still  been  replicated  six 
times  more  then  any  other  point.  While  the  points  on  the  surface  of  the  sphere 
have  been  replicated  five  times,  the  center  point  has  been  replicated  30  times, 

10  times  within  each  of  the  three  blocks. 

At  this  point  the  investigator  must  decide  whether  or  not  to  retain  and 
analyze  directly  the  data  corresponding  to  all  100  observations.  He  could  collapse 
his  data  across  those  subjects  within  the  same  block  who  were  observed  at  the  same 
experimental  point  and,  then  analyze  the  collapsed  data.  The  net  effect  of 
collapsing  in  this  manner  is  a  data  matrix  identical  in  form  and  number  of  observa¬ 
tions  to  one  resulting  from  the  original  blocked  central -composite  design  shown  in 


Figure  2.  Now,  however,  the  data  ere  confined  vclues  obtained  from  collapsing 
rather  than  data  representing  a  simple  observation.  In  addition,  estimates  of  experi¬ 
mental  error  ere  obtained  from  the  resulting  six  center  poinrs,  each  of  which  is  a 
collapsed  score. 

This  procedure  has  been  used  successfully  by  Mueller  and  Simon  f 1 969 r  in 
a  target  recognition  experiment.  It  has  the  advantage  of  retaining  all  the  features 
of  a  RSM  central-composite  design  as  well  as  cdcFng  1  ability  to  the  experimental 
data  points  because  the  collcpsed  data  are  nor  heavily  erased  by  the  results  of 
cny  one  extreme  subject.  This  is  especially  valid  if  the  median  is  used  as  the 
combining  statistic.  Since  it  is  probably  of  little  value  to  develop  unique  prediction 
equations  for  each  subject,  such  a  collapsing  procedure  ma7  be  appropriate  even 
though  degrees  of  freedom  ere  last  from  the  design. 

Noncollapsed  Designs 

Suppose,  however,  that  the  investigator  replicating  the  blocked  central- 
composite  design  chooses  not  to  collcpse  his  data  across  subjects.  Rather,  he 
retains  each  of  the  subject's  data  for  subsequent  analysis.  By  retaining  all  this 
information  he  gains  degrees  of  freedom  for  the  error  term  which  were  previously 
lost  by  collapsing  the  data.  Error  is  now  estimated  across  all  points  at  which 
replications  occur,  instead  of  using  only  the  estimate  of  the  error  at  the  center 
point.  It  is  quite  possible  that  there  may  be  certain  areas  of  the  experimental 
region  in  which  there  is  considerable  vai ’ability  in  response  and  other  areas  in 
which  the  variability  is  negligible.  This  is  particularly  true  if  the  range  of  factor 
levels  under  consideration  is  sizable.  Given  this  variability,  it  is  not  reasonable 
to  use  the  estimate  of  error  at  only  one  such  area  as  an  estimate  of  error  throughout 
the  experimental  region.  The  prediction  equation  which  one  develops  should  afford 
a  reasonable  description  of  the  entire  response  surface,  not  merely  a  selected  area 
of  that  response  surface. 

When  noncollapsed  designs  are  used,  the  investigator  must  make  another 
major  decision  with  respect  to  his  selected  design.  If,  due  to  the  nature  of  his 
research  problem,  he  chooses  to  observe  different  subjects  at  each  of  the  experimental 
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points,  the  resulting  study  constitutes  a  between-subjects  design.  If,  on  the  other 
hand,  he  elects  to  observe  each  of  c  set  of  subjects  under  all  experimental  conditions, 
the  resulting  study  constitutes  a  within-subject  design.  The  choice  of  a  beiween- 
versus  a  within-subject  design  is  dictated  by  the  particular  question  which  the 
researcher  is  investigating.  In  either  case,  if  the  necessary  restrictions  ore  observed, 
the  design  conforms  to  the  basic  central-composite  design. 

Between-Subjects  Designs.  Given  certain  research  questions,  observing  the 
same  subjects  under  more  than  one  experimental  condition  would  lead  one  to  draw 
invalid  conclusions  concerning  the  effect  of  the  various  experimental  manipulations. 
Consider,  for  example,  an  investigation  of  the  comparative  efficacy  of  selected 
training  methods.  Certainly  Training  Method  B  cannot  be  evaluated  accurately  by 
observing  the  performance  of  subjects  who  have  previously  been  trained  to  criterion 
under  Method  A,  because  the  observed  performance  may  be  a  function  of  not  only 
the  condition  itself,  but  also  of  the  preceding  condition  which  he  has  experienced. 

In  such  a  case  it  is  imperative  that  the  investigator  adopt  a  befween-subjects  design, 
observing  each  subject  under  only  one  experimental  condition. 

Recall  the  detection  latency  study  which  replicates  the  orthogonally  blocked 
central-composite  design  of  Figure  2  five  times.  If  100  different  subjects  are  observed 
across  those  20  data  points  (six  of  v/hich  are  the  center  point),  a  between-subjects 
design  is  realized.  Since  the  full  central -composite  design  is  being  replicated  in 
tact,  the  necessary  relationship  guaranteeing  orthogonal  blocking,  as  given  in 
Equation  1, is  still  satisfied.  As  in  the  original  design  the  center  point  is  being 
replicated  six  times  more  than  any  other  point.  Although  experimental  error  is  now 
being  estimated  across  all  data  points,  the  results  of  a  subsequent  analysis  to 
determine  a  first  order  prediction  equation  are  of  the  same  type  shown  in  Table  2. 

The  increased  number  of  observations  is  reflected  in  the  values  for  total  degrees  of 
freedom,  residual  degrees  of  freedom,  and  error  degrees  of  freedom;  the  adjusted 
values  are  99,  96,  and  83,  respectively.  Meyer  (1963)  has  used  this  design  procedure 
successfully  in  a  human  learning  experiment. 

If,  indeed,  the  variability  in  response  at  each  of  a  series  of  data  points  is 
used  as  an  estimate  of  experimental  error  variance,  there  is  no  need  to  replicate 
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one  point  more  then  any  other.  In  the  original  central-composite  design,  in 
which  only  the  center  point  is  replicated,  the  additional  observations  at  that  point 
provide  the  investigator  v/ith  his  only  estimate  of  error.  But,  with  repeated  ob¬ 
servations  occurring  at  each  of  the  experimental  points,  there  appears  no  need  to 
make  more  observations  at  the  center.  The  investigator  could  choose  instead  to 
replicate  eacl  of  the  experimental  points,  including  the  center,  an  equal  number 
of  times,  while  still  maintaining  the  use  of  different  subjects  for  each  observation. 

Eliminating  observations  at  the  center  point,  however,  has  implications  for 
orthogonal  blocking.  It  is  now  necessary  to  adjust  the  value  of  a  accordingly, 
because  the  original  blocking  has  been  disturbed  due  to  the  elimination  of  center 
points  from  the  factorial  portion  of  the  design  and  the  reduction  in  the  number  of 
center  points  in  the  star  portion  of  the  design.  With  respect  to  the  target  detection 
latency  example  in  which  repeated  observations  are  made  at  each  of  15  unique 
experimental  points,  making  the  appropriate  adjustment  results  in  an  a  value  of 
1  .87  rather  than  1 .633,  as  defined  by  Equation  ]  .  This  change  in  the  a  value  is 
reflected  in  Figure  3  which  designates  the  orthogonal  blocking  of  the  15  unique 
experimental  points.  Note  the  reduction  of  data  collection  points  within  each  of 
the  three  blocks,  and  the  complete  absence  of  center  points  in  Blocks  1  and  2. 

Changing  the  coded  value  of  a  also  necessitates  reselecting  the  real  world 
levels  of  the  various  factors  under  study.  Recalculating  the  levels  of  display 
resolution,  for  example,  the  investigator  learns  that  the  five  levels  appropriate 
to  the  new  orthogonally  blocked  design  are  149,  236,  336,  436,  and  523.  Selecting 
5i«e  levels  retains  the  center  of  the  experimental  region,  but  increases  its 
rc-!ge-  over  that  indicated  in  Figure  2. 

Replicating  this  modified  centra  I -composite  design  five  times,  the  in¬ 
vestigator  makes  a  total  of  75  observations,  20  in  Block  1,  20  in  Block  2,  and 
35  in  Block  3.  Submitting  these  75  observations  to  direct  analysis  to  determine 
a  first-order  prediction  equation  yields  results  similar  to  those  shown  in  Table  2. 
Again,  the  change  in  design  is  reflected  in  corresponding  changes  in  values  of 
total  degrees  of  freedom,  residual  degrees  of  freedom,  and  error  degrees  of 
freedom;  the  adjusted  values  are  74,  71,  and  60,  respectively. 


BLOCK  1 

BLOCK  2 

BLOCK  3 

<  i,  -i,D 

(-1,  i, i ) 

(-1.87,  0,0  ) 

<  1. 1, -i i 

(-1,-1, i  > 

(0, -1.87,0) 

M,  1,1) 

( i,-i,-n 

(0,  0 ,  -J.87) 

t  1,-1 , -1  ) 

(1,1,1/ 

(1.87,0, 0) 

(0,1.87 ,0) 

(  0,  0,  1.87  ) 

(0,  0,0) 

Figure  3.  Orthogonal  blocking  of  second-order  central -composite  design  in  three 
variables  with  coded  value  coordinates  of  data  points  employing  equal 
number  of  replications. 
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Within-Subject  Designs.  On  occasion  the  objectives  of  an  experiment  make 
it  appropriate  and  desirable  to  observe  each  subject  in  each  treatment  condition.  In 
such  a  case,  each  individual  serves  as  his  own  control ,  and  between-subject 
vcricbility  does  not  affect  the  experimental  conditions.  Moreover,  observing  the 
same  set  of  subjects  under  each  treatment  condition  affords  another  obvious  advantage 
over  the  befween-subjects  designs  in  that  fewer  subjects  are  needed  to  conduct  the 
study,  albeit  one  may  encounter  the  familiar  problem  of  subject  attrition.  Of  course, 
this  design  strategy  is  not  appropriate  when  a  subject's  performance  in  one  condition 
is  affected  by  prior  experience  with  any  of  the  other  conditions.  As  previously  mentioned, 
a  within-subject  design  is  inappropriate  for  studying  differential  training  effectiveness. 
However,  it  could  be  used  effectively  to  investigate  the  differential  suitability  of 
various  display  formats  to  enhance  target  detection  where  there  is  little  or  no 
transfer  from  display  to  display.  When  these  within-subject  designs  are  used,  caution 
must  be  exercised  to  implement  the  proper  counterbalancing  so  as  to  avoid  spurious 
order  effects. 

The  within-subject  design  combines  several  features  of  the  central-composite 
design  variations  previously  discussed.  Again,  a  check  should  be  made  to  insure 
that  the  selected  value  guarantees  orthogonality  in  the  case  of  blocked  designs,  or 
rotatability  in  the  case  of  unblocked  designs.  The  appropriate  real  world  levels 
of  the  experimental  factors  are  then  determined  accordingly.  Data  are  retained, 
uncollapsed  from  repeated  observations  made  at  each  of  the  experimental  points, 
thereby  affording  increased  degrees  of  freedom  for  the  resulting  error  term.  As 
in  the  other  design  variations,  the  within-subject  design  permits  tests  for  the 
significance  of  blocking  and  of  lack  of  fit  as  well  as  tests  of  individual  partial 
regression  coefficients.  In  addition,  a  subject  term  can  be  isolated  and  tested  foi 
significance.  Because  subjects  are  completely  crossed  with  treatments  (every 
subject  receives  every  treatment  once),  one  can  refine  his  estimate  of  experimental 
error  variance  by  accounting  for  the  variability  within  the  individual  subjects  after 
assessing  the  variability  within  treatment  conditions.  The  error  term  which  results 
from  merely  accounting  for  the  variability  of  response  at  the  experimental  points 
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irrespective  of  subject  is  confounded  by  an  interaction  between  subjects  and  the 
treatment  conditions.  By  removing  this  effect,  a  better  estimate  of  experimental 
errcr  is  available  for  subsequent  tests  for  significance  of  the  partial  regression 
weights  of  the  RSM  prediction  equations. 

By  way  of  example,  the  same  four  subjects  might  be  observed  at  each  of  the 
15  experimental  points  designated  in  Figure  3,  thereby  yielding  a  total  of  60 
observations.  Hypothetical  data  for  such  a  design  are  presented  in  Table  3. 

Note  that  the  1 .87  value  fora  is  still  appropriate  because  all  15  points,  including 
the  center  point,  are  being  replicated  an  equal  number  of  times.  A  multiple  regression 
analysis  of  these  hypothetical  data  yields  the  following  first  order  prediction  equation: 
Y  =  16.44  -  1 .16751591  X]  -  0.39631381  X2  +  0.82118942  X3 

Substituting  given  levels  of  display  resolution,  visual  angle,  and  random  noise  for 
X^,  X^,  and  X^,  respectively,  into  this  equation  provides  a  corresponding  predicted 
level  of  detection  latency. 

The  results  of  a  subsequent  ANOVA  performed  on  the  hypothetical  data 
of  the  regression  analysis  appear  in  Table  4.  Note  the  additional  "subjects" 
component  into  which  residual  variance  has  been  subdivided.  The  corresponding 
degrees  of  freedom  reflect  the  use  of  four  subjects  throughout  the  experiment. 

Notice  also  that  the  error  degrees  of  freedom  are  reduced  by  3,  the  degrees  of 
freedom  attributed  to  the  subject  factor.  Had  this  experiment  utilized  different 
subjects  throughout,  the  value  of  error  degrees  of  freedom  would  have  been  45 
rather  than  42.  But,  in  the  case  of  within-subject  designs,  the  error  term  is 
purified  by  removing  the  subject  effect  from  it. 

CONCLUSIONS 

The  techniques  of  RSM,  and  the  central -composite  design  in  particular, 
can  be  effectively  used  in  human  factors  research,  where  the  goal  is  frequently 
the  development  of  an  equation  to  describe  the  relationship  between  human  per¬ 
formance  and  a  host  of  equipment  parameters.  Certain  modifications  in  the  basic 
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TABLE  3 


Hypothetical  Data  in  Coded  Form  For  A  Three-Factor,  Central-Composite, 
RSM  Design  Using  Repeated  Measures  on  Four  Subjects 


Defection  Latency  (Seconds) 
For  Four  Subjects 


Resolution 

Visual  Angle 

Random  Noise 

^2 

^3 

^4 

1.0 

-1.0 

1 .0 

15.8 

15.9 

16.1 

16.4 

1.0 

1.0 

-1 .0 

14.3 

14.5 

14.0 

14.8 

-1 .0 

1.0 

1.0 

17.0 

17.3 

17.1 

16.9 

-1 .0 

-1.0 

-1.0 

17.4 

17.5 

17.0 

17.3 

-1  .0 

1.0 

-1.0 

16.8 

16.7 

17.0 

17.0 

-1  .0 

-1.0 

1.0 

18.1 

18.3 

18.6 

18.1 

1.0 

-1.0 

-1 .0 

14.9 

15.2 

14.5 

15.0 

1.0 

1.0 

1 .0 

16.2 

16.7 

16.4 

15.9 

-1.87 

o 

• 

o 

0.0 

19.0 

19.1 

18.9 

19.5 

0.0 

-1.87 

0.0 

17.3 

16.9 

17.4 

16.8 

0.0 

0.0 

-1 .87 

15.1 

15.3 

14.4 

15.0 

1.87 

0.0 

0.0 

13.9 

14.2 

13.7 

14.1 

0.0 

1.87 

0.0 

14.9 

15.0 

14.8 

15.0 

0.0 

0.0 

1  .87 

19.2 

19.0 

20.0 

18.9 

0.0 

0.0 

0.0 

15.8 

16.1 

16.4 

16.0 

-  - . .  . 
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TAB1.E  4 


First-Order  Regression  Analysis  of  Variance  Summary  Table 
for  Hypothetical  Detection  Latency  Data  of  Four  Subjects 


Source 

Percent 

£ 

Sum  of  Squares 

Mean  Square 

F-Value 

Regression 

(84.98) 

(  3) 

131.62 

43.87 

548.37*** 

bl 

52.78 

1 

81.75 

81.75 

1021.87*** 

b2 

6.08 

1 

9.42 

9.42 

117.75  *** 

b3 

26.11 

1 

40.44 

40.44 

505.50  *** 

Residual 

(15.02) 

(56) 

23.27 

0.42 

Blocks 

0.83 

2 

1.29 

0.65 

8.13  ** 

Subjects 

0.09 

3 

0.14 

0.05 

0.63 

Lack  of  Fit 

11.84 

9 

18.34 

2.04 

25.50*** 

Error 

2.26 

42 

3.50 

0.08 

Total 

100.00 

59 

154.88 

**  p  <  .01 

***  p  <  .001 


Multiple  Regression  Coefficient  =  0.92 
Coefficient  of  Determination  =0.85 
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cenfrcl -composite  design,  bov/ever,  appear  to  make  the  method  more  appropriate 
to  research  involving  human  subjects .  In  making  the  appropriate  design  modifi¬ 
cations,  the  investigator  must  make  several  major  decisions.  He  must  decide  whether 
or  nor  to  make  repeated  observations  over  a  series  of  experimental  points  rarher  than 
at  a  single  point.  If  his  goal  is  to  develop  a  global  prediction  equation  to  approxi¬ 
mate  the  entire  response  surface,  replications  at  each  of  the  experimental  data- 
coilecrion  points  appears  to  be  a  wise  strategy .  The  bajic  central-composite  design, 
calling  ter  replications  at  only  the  center  paint,  is  perhaps  better  reserved  for 
preliminary  research  where  the  primary  aim  is  to  ascertain  quickly  what  major 
factors  appear  worthy  of  more  thorough  study . 

The  investigator  must  also  select  either  a  befween-subiects  or  a  wifhin- 
subject  design.  This  choice  is  dictated  by  the  objectives  of  his  particular  experiment. 
Of  the  design  variants  discussed  above,  those  advocating  multiple  and  equal 
replications  at  all  experimental  points,  followed  by  analysis  of  uncollapsed  data, 
appear  the  most  advantageous,  whether  they  are  conceived  as  between-  or  v/ithin- 
subject  designs.  The  particular  modifications  which  the  investigator  elects  to 
implement  have  ramifications  for  other  aspects  of  the  design  such  as  orthogonal 
blocking  and  rotatability .  Appropriate  adjustments  must  be  made  in  factor  level 
selection  in  order  to  retain  such  attributes  in  view  of  the  overall  design  modification. 
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